Inferring pandemic growth rates from sequence data
نویسندگان
چکیده
Using sequence data to infer population dynamics is playing an increasing role in the analysis of outbreaks. The most common methods in use, based on coalescent inference, have been widely used but not extensively tested against simulated epidemics. Here, we use simulated data to test the ability of both parametric and non-parametric methods for inference of effective population size (coded in the popular BEAST package) to reconstruct epidemic dynamics. We consider a range of simulations centred on scenarios considered plausible for pandemic influenza, but our conclusions are generic for any exponentially growing epidemic. We highlight systematic biases in non-parametric effective population size estimation. The most prominent such bias leads to the false inference of slowing of epidemic spread in the recent past even when the real epidemic is growing exponentially. We suggest some sampling strategies that could reduce (but not eliminate) some of the biases. Parametric methods can correct for these biases if the infected population size is large. We also explore how some poor sampling strategies (e.g. that over-represent epidemiologically linked clusters of cases) could dramatically exacerbate bias in an uncontrolled manner. Finally, we present a simple diagnostic indicator, based on coalescent density and which can easily be applied to reconstructed phylogenies, that identifies time-periods for which effective population size estimates are less likely to be biased. We illustrate this with an application to the 2009 H1N1 pandemic.
منابع مشابه
Inferring Disease Contact Networks from Genetic Data
The analysis of genetic sequence data collected during disease outbreaks has emerged as a promising new tool for understanding infectious disease dynamics and designing control measures against infectious diseases. Hence, there is a need for statistical methodologies that effectively integrate genetic data with other epidemiological data to perform inference on the underlying disease dynamics. ...
متن کاملMortality from the influenza pandemic of 1918–19 in Indonesia
The influenza pandemic of 1918-19 was the single most lethal short-term epidemic of the twentieth century. For Indonesia, the world's fourth most populous country, the most widely used estimate of mortality from that pandemic is 1.5 million. We estimated mortality from the influenza pandemic in Java and Madura, home to the majority of Indonesia's population, using panel data methods and data fr...
متن کاملThe Systemic Imprint of Growth and Its Uses in Ecological (Meta)Genomics
Microbial minimal generation times range from a few minutes to several weeks. They are evolutionarily determined by variables such as environment stability, nutrient availability, and community diversity. Selection for fast growth adaptively imprints genomes, resulting in gene amplification, adapted chromosomal organization, and biased codon usage. We found that these growth-related traits in 2...
متن کاملThe Hiv/aids Pandemic in South Africa: Sectoral Impacts and Unemployment
South Africa is currently confronting an HIV/AIDS crisis. HIV prevalence in the population is currently estimated at about 13% with that number projected to increase over the next five years or so. Given the massive scale of the problem and the concentration of effects on adults of prime working age, the pandemic is expected to sharply influence a host of economic and non-economic variables. Wh...
متن کاملMortality and transmissibility patterns of the 1957 influenza pandemic in Maricopa County, Arizona
BACKGROUND While prior studies have quantified the mortality burden of the 1957 H2N2 influenza pandemic at broad geographic regions in the United States, little is known about the pandemic impact at a local level. Here we focus on analyzing the transmissibility and mortality burden of this pandemic in Arizona, a setting where the dry climate was promoted as reducing respiratory illness transmis...
متن کامل